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Massive galaxy clusters are now found as early as ~ 3 billion years after the 
Big Bang, containing stars that formed at even earlier epochs.!'? The high- 
redshift progenitors of these galaxy clusters, termed *protoclusters', are identi- 
fied in cosmological simulations with the highest dark matter overdensities.^* 


While their observational signatures are less well defined compared to virial- 
ized clusters with a substantial hot intra-cluster medium (ICM), protoclusters 
are expected to contain extremely massive galaxies that can be observed as lu- 
minous starbursts. Recent claimed detections of protoclusters hosting such 
starbursts*-!! do not support the kind of rapid cluster core formation expected 
in simulations" because these structures contain only a handful of starburst- 
ing galaxies spread throughout a broad structure, with poor evidence for even- 
tual collapse into a protocluster. Here we report that the source SPT2349-56 
consists of at least 14 gas-rich galaxies all lying at z = 4.31 based on sensitive 
observations of carbon monoxide and ionized carbon. We demonstrate that 
each of these galaxies is forming stars between 50 and 1000 times faster than 
our own Milky Way, and all are located within a projected region only — 130 
kiloparsecs in diameter. This galaxy surface density is more than 10 times the 
average blank field value (integrated over all redshifts) and >1000 times the 
average field volume density. The velocity dispersion (~ 410 km s^!) of these 
galaxies and enormous gas and star formation densities suggest that this sys- 
tem represents a galaxy cluster core at an advanced stage of formation when 
the Universe was only 1.4 billion years old. A comparison with other known 
protoclusters at high redshifts shows that SPT2349-56 is a uniquely massive 
and dense system that could be building one of the most massive structures in 
the Universe today. 


In a multi-band survey over 2500 deg? of sky, the South Pole Telescope (SPT) discovered a 
population of rare (n ~ 0.04 deg ?), extremely bright (S44,,, > 20mJy) millimeter-selected 
sources. ^^ The Atacama Large Millimeter Array (ALMA) 870-um imaging showed that 
more than 9096 of these SPT-selected sources are single high-redshift submillimeter galaxies 
(SMGs)? with intrinsic flux densities of Sg7o 4m = 5 — 10 mJy, but gravitationally lensed by 
factors of 5 — 20,!° with a median redshift z ~ 4." However, ~ 10% of these sources show no 
evidence for lensing and may instead be intrinsically very luminous galaxies or even groups of 
multiple rapidly star-forming galaxies. The brightest such source in the SPT 2500 deg? survey, 
SPT2349-56 (5, 4,,,223.3 mJy), is revealed by LABOCA (a low resolution bolometer cam- 
era on the APEX telescope) observations at 870 jum to consist of two elongated sources with a 
combined flux density S870 pm ~ 110 mJy (Fig. 1), with the brighter southern source comprising 
~ 77 mJy of this flux density. An ALMA redshift survey!” further resolved SPT2349-56 into a 
pair of bright 3-mm sources associated with the southern LABOCA source, with both lying at 
z = 4.3. 

To better understand the nature of this structure, deep ALMA spectral imaging of the 
brighter southern peak of the extended LABOCA source was undertaken. A 358-GHz map 
containing the redshifted [CII]1900.5 Guz line was used to search for line-emitting galaxies. A 
blind spectral line survey (described in the Methods) was performed on the data cube, revealing 
14 z ~ 4.31 line emitters at high significance (SNR >7). Twelve of these emitters are indi- 
vidually detected in the 1.1-mm continuum map at 5o, with 1.1-mm flux densities ranging 
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from 0.2-5 mJy (Fig. 1). The remaining two line emitters (M,N) are both detected at lower sig- 
nificance in the 1.1-mm continuum map but have robust IRAC infrared counterparts (Extended 
Data Table 1, Extended Data Fig. 4). Nine of these sources are also detected (> 5c) in the 
CO(4-3) line. The ALMA spectra are shown in Fig. 1. 

The measurements of both the continuum and spectral lines of the 14 galaxies allow us 
to estimate their star formation rates (SFRs), gas masses, and dynamical masses (Tables 1 & 
Extended Data Table 1). The physical properties of these sources indicate that this protocluster 
already harbors massive galaxies that are rapidly forming stars from an abundant gas supply. 
The two brightest sources, A & B, have SFRs in excess of 1000 solar masses per year within 
their resolved ~ 3-kpc radii (Extended Data Table ). The total SFR of the 14 sources is 6000 + 
600 Mo yr^!. Multi-colour imaging with Herschel-SPIRE (250, 350, 500 jum), in addition to 
the 870-um LABOCA map, shows that the northern LABOCA structure is also consistent with 
lying at z = 4.3 (see Methods). The sources detected in the ALMA 870-um imaging therefore 
comprise just 50% of the total flux density of the southern LABOCA source and 36% of the total 
LABOCA flux density, suggesting that the inner ~ 500 kpc of this protocluster contains 16,500 
Mo yr lof star formation. Modelling the spectral energy distribution based on this combined 
submillimeter photometry yields an IR luminosity (from 8-1100 um) of (8.0 + 1.0) x 1013 Lo. 

The gas masses of the 14 protocluster galaxies, estimated from CO(4-3), or [CII] if unde- 
tected in CO(4-3) (see Methods), range from 1 x 10'° to 1 x 10!! Ms, with a total gas mass 
of ~ 6 x 10!! (Xco/0.8) Mo. A follow-up survey of colder molecular gas in CO(2-1) with 
the ATCA radio telescope detects the bulk of this large gas repository, especially in the central 
region near sources B, C, & G, and confirms that the assumptions about gas excitation used to 
convert CO(4-3) to Hz gas masses of the galaxies are reasonable. Based on simulations!? and 
measurements of lower-redshift systems that have a similar gravitational potential well depth, 
we expect and calculate explicitly in the Methods that the cold gas may comprise only a small 
fraction of the baryon budget. The bulk of the baryons may already be in the form of a dif- 
fuse, hot gas filling the space between the galaxies — the ICM that is characteristic of massive 
virialized galaxy clusters at z « 1.5. 

The detected ALMA sources also enable an initial estimate of the mass of the protoclus- 
ter. We determine the mean redshift using the biweight estimator? as (z),, = 4.3040*005:9- 
The velocity dispersion of the galaxy distribution is cy; = 408734 km s^! according to the bi- 
weight method, ? which is the standard approach for galaxy samples of this size. Other common 
methods (gapper,'? Gaussian fit) agree to within 3% and provide similar errors. Under the as- 
sumption that SPT2349-56 is approximately virialized, the mass-dispersion relation for galaxy 
clusters’? indicates a dynamical mass of May, = (1.16 + 0.70) x 10? Mo, which is an upper 
limit if the system has not yet virialized. The total halo mass indicates that the protocluster is 
a viable progenitor of a > 10!? Me galaxy cluster comparable to the Coma cluster at z = 0 
(Fig. 2).? The location of SPT2349-56 in this plane suggests a very massive descendant, but 
we caution that N-body simulations indicate that it is difficult to reliably predict z = 0 halo 
mass from the halo mass at a given epoch due to the large halo-to-halo variation in dark matter 
halo growth histories.?! 
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To study the relative overdensity and concentration of SPT2349-56, it is desirable to com- 
pare with other active protoclusters at high redshift. SPT2349-56 is highly overdense, as it 
harbors 10 SMGs with 5; ;,,4, Z 0.5 (a level at which we are complete, with uniform sensi- 
tivity across our search area) located within a circle of diameter 19" (130 kpc), corresponding 
to a number density of N(S; 1, > 0.5 mJy) z 2 x 10* deg ?. By comparison, the average 
number of field sources with 5,44, > 0.5 mJy within this area across all redshifts is less than 
one;? thus, this field is overdense by more than a factor of 10. When we account for the fact 
that all sources are at the same redshift, the volume density is > 1000x the field density, as- 
suming a redshift binning of Az=0.1 and the redshift distribution for SMGs.”* In Fig. 2, we 
plot ‘curves of growth’ of the total 870-um flux density versus on-sky area for SPT2349-56 
and other SMG-rich protoclusters (see Methods for the details of the comparison sample). For 
SPT2349-56, we plot both the total flux density of the 14 confirmed protocluster members de- 
tected with ALMA and the total flux density of the extended LABOCA structure. The curve 
of growth for SPT2349-56 rises much more steeply than those of the other high-redshift proto- 
clusters, demonstrating its extreme density. For SPT2349-56 the on-sky area encompassing the 
accumulated 870-um flux density (and thus approximately the total SFR) is as much as 3 orders 
of magnitude less than for other protoclusters at z > 2. SPT2349-56 clearly stands out as the 
densest collection of SMGs: although some other protoclusters contain as many SMGs, they 
extend over much larger areas on the sky, with separations often exceeding 10 arcmin (800 to 
1400 cMpc at z — 4.3 to z — 2). This comparison demonstrates that SPT2349-56 is likely ob- 
served during a significantly more advanced stage of cluster formation than other high-redshift 
protoclusters, a cluster core in the process of assembly rather than an extended structure that 
may not even collapse to form a cluster by the present day.'* 

Also shown in Fig. 2 is the maximal curve of growth predicted by a theoretical model 
for submm-luminous protocluster regions at z ~ 4.5 (see Methods for details). Except for 
SPT2349-56 and the recent Herschel discovery SMM J004224,”* the comparison high-z pro- 
toclusters exhibit S79 ,,.m curves of growth fairly consistent with the model expectations. The 
model prediction for the region spanned by SPT2349-56 is ~ 10% of the observed total flux 
density of the 14 ALMA sources. The under-prediction is more severe if we consider the ex- 
tended LABOCA source: only ~ 5% of the observed flux density is recovered. This dis- 
crepancy may suggest that environmental effects (such as enhanced galaxy interactions or gas 
accretion in high-density environments) that are not included in the theoretical model employed 
are responsible for the extremely high SFR density exhibited by SPT2349-56. An alternative 
theoretical approach, ‘zoom’ hydrodynamical simulations of protoclusters,? can potentially 
capture such environmental effects, but to date, such simulations have been unable to reproduce 
the extremely high SFR inferred for SPT2349-56: of the 24 protocluster simulations presented 
by these authors, the maximum total SFR attained was ~ 1700 Mo yr~*, an order of magnitude 
less than that of SPT2349-56. However, the volume of the N-body simulation from which the 
24 halos were selected was 1 h~* cGpc?, which may be too small to contain an object as rare 
as SPT2349-56. Nevertheless, the existence of SPT2349-56, which contains an unprecedented 
concentration of rapidly star-forming SMGs when the Universe was only 1.4 Gyr old, poses a 
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formidable challenge to theoretical models seeking to explain the origin and evolution of galaxy 
(proto)clusters. 

SPT2349-56 may represent a significantly more advanced stage of cluster formation than 
the typical z > 4 protoclusters identified to date, as outlined above. Since the cores of present- 
day galaxy clusters are characterized by massive elliptical galaxies with old-to-intermediate-age 
stellar populations? and SMGs are thought to be the high-redshift progenitors of present-day 
ellipticals,? it is likely that the 14 SMGs located at the same redshift within a region < 130 
kpc in diameter will soon merge to form a massive elliptical galaxy at the core of a lower- 
redshift galaxy cluster. This can be quantified by considering the total energy per unit mass 
of the system E = 1/2v? + ® (km s !)?. The total energy is negative for the 14 sources, 
assuming the individual halo masses are as little as >2 x the masses implied by their central line 
widths, a condition that is easily met for any local galaxies." Theoretical studies have shown 
that at z > 4, the progenitors of galaxy clusters should span > 5 comoving Mpc (cMpc), ^? 
corresponding to an angular scale of as much as a degree; we are thus possibly observing only 
a small part of a much larger structure. For SPT2349-56, it is unknown whether the overdensity 
extends over such a large scale, as more detailed observations are required to characterize the 
field surrounding SPT2349-56. We have demonstrated that the extended LABOCA-detected 
complex has submm colours similar to the core region identified by our ALMA observations and 
is thus likely all at z ~ 4.3. We have also identified five additional bright SPIRE sources in the 
surrounding ~ 800 x 800 cMpc field with similar red colours lying several arcmin from the core 
structure (see Methods). These are candidates for additional protocluster members located in an 
extended, collapsing structure, similar to the comparison SMG overdensities shown in Fig. 2. 
If all these sources are confirmed to lie at z — 4.31, this would approximately double the far-IR 
luminosity of the cluster, making it by far the most active system known in the Universe. Since 
SPT2349-56 was selected from a blind mm survey of 2500 deg? (approximately 1/16th of the 
sky), it is unlikely there are more than approximately 16 such structures across the entire sky. A 
full analysis of other unlensed sources from the SPT survey to identify possible systems similar 
to SPT2349-56 will place stronger constraints on early structure formation in the Universe. 
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Methods 


1 Observations 


1.4 SPT, LABOCA, and Herschel discovery and ALMA follow-up 


The South Pole Telescope?! (SPT) possesses a unique combination of sensitivity, selection 
wavelengths (3, 2, and 1.4 mm), and beam size that potentially make it ideal for finding the ac- 
tive core regions of galaxy clusters forming at the earliest epochs. Finding very distant (z > 4), 
gravitationally lensed millimetre sources in the SPT survey is relatively straightforward, where 
the contrast to such distant bright sources is high relative to the weak (generally undetected) 
galactic foregrounds (Extended Data Figure 1). However searching for the rare SMGs in the 
SPT 2500 deg? survey that are unlensed, and therefore candidates for active groups and proto- 
clusters like SPT2349-56, involves sifting through the many gravitationally lensed sources, and 
typically involves multi-stage follow-up efforts using various facilities: a single dish mapping 
instrument like APEX-LABOCA to better localize the emission within the ~ 1' SPT beam, deep 
optical imaging to search for bright lensing galaxies, and high resolution ALMA mapping. The 
spatially extended sources in SPT2349-56 found with LABOCA span more than an arc min.. 
With deep upcoming surveys using the next generation SPT-3G receiver, this 'extended-beam' 
thermal source structure may present a unique signature of many early forming protoclusters, 
affording the first complete census in the early epochs of structure formation. 

A shallow, wide field SPIRE image over a 100 deg? subregion of SPT-SZ? reveals the red 
colours of SPT2349-56, and that SPT2349-56 appears to reside in something of a void in the 
z ~ ] foreground that dominates the SPIRE galaxy population. However the high redshift of 
SPT2349-56 means that it is not significantly brighter than many other SPIRE sources in this 
field, and aside from its colours, SPT2349-56 does not stand out substantially from the field 
despite its extreme properties. SPT2349-56 is not detected in the all sky Planck survey,’ the 
lower sensitivity of Planck compared with SPT being exacerbated by beam dilution in the 3’ 
beam. 

Obtaining the redshift for SPT2349-56 was beyond the scope of the original SPT-SMG 
redshift survey, due to the faintness of the unlensed components relative to the typical bright, 
gravitationally lensed SMGs found in the bulk of the SPT-SMG sample. In ALMA Cy 1, 
SPT2349-56 was included in the 3-mm spectral scan redshift survey,'>*? but no lines were 
detected in the short ~ 1min integrations with 16 ALMA antennae. In Cy 3, a deeper follow-up 
3-mm spectral scan was able to tentatively identify two CO lines and a double source structure 
with a likely redshift z = 4.30, confirmed by APEX/FLASH C^ detection." 
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11.1 APEX-LABOCA 


We obtained 870-um imaging of SPT2349-56 using LABOCA on the APEX telescope. A 
shallow image with 1.6hr integration time was observed on 27 Sep 2010 reaching — 5 mJy/beam 
rms. In August 2017 we obtained a deeper image (18.8h integration time, Project ID: E-299.A- 
5045A-2017) reaching a reaching a minimum noise level of 1.3 mJy/beam and « 2.0 and « 
1.5 mJy/beam rms for 75.3 and 32.4 sq arcmin, respectively (shown in Figure 1 & Extended 
Data Figure 2). All observations were carried out using standard raster-spiral observations?! 
under good weather conditions (PWV of 0.6 mm and 0.8 mm for the 2010 and 2017 observing 
campaigns, respectively). Calibration was achieved through observations of Uranus, Neptune 
and secondary calibrators and was found to be accurate within 8.5% rms. The atmospheric 
attenuation was determined via skydips every 2hr as well as from independent data from the 
APEX radiometer which measures the line of sight water vapor column every minute. The data 
was reduced and imaged using the BoA reduction package.? LABOCA's central frequency and 
beam size are 345 GHz and 19.2", resolving the SPT 1.4-mm elongated source into two bright 
LABOCA sources. 

Both LABOCA observations yield consistent calibration results with peak intensity at 21" 


resolution of 50 mJy/beam for the brighter, southern component (RA 23:49:42.70, DEC -56:38:23.4). 


In addition the LABOCA map reveals a second source to the north at RA: 23:49:42.86, DEC: - 
56:37:31.02 with a peak flux density at 21" resolution of 17 mJy/beam. Both sources are clearly 
extended even at LABOCA’s relatively coarse spatial resolution with deconvolved source size 
of 18" x 12" and 32" x 5" for the sourthern and northern source, respectively. These components 
are connected by a faint bridge emission. The total 870-um flux density of the SPT2349-56 sys- 
tem is 110.0+9.5 mJy, of which ~77 mJy are associated with the southern component, ~25 mJy 
with the northern component, and ~7 mJy with the connection between the components (using 
the sub apertures shown in Extended Data Figure 2). One additional submm source is detected 
at > 5c in the LABOCA image to the east of the primary source, but having blue colours 
inconsistent with z ~ 4, and not likely being a member of the extended protocluster. 


12 ALMA 


Observations using ALMA Band-3 targeted the CO(4-3) line in SPT2349-56 centred in the 
lowest frequency of the spectral windows adopted (86-88 GHz), taken under a Cycle 3 program 
2015.1.01543.T (PI: K. Lacaille). Data was taken on June 24th, 2016 with a 47 min integration 
time. The array used 36 antennas with baselines ranging from 15 to 704 m, and provided a 
naturally weighted synthesized beam size of — 1". Pallas and J2343-5626 were used to calibrate 
the flux and phase respectively. Data was processed using the standard ALMA pipeline using 
natural beam weighting. 

ALMA Band-7 imaging (276 GHz) were obtained under a Cycle 4 program (2016.0.00236.T; 
PI: S. Chapman) targeting the peak of the brightest LABOCA source. Observations were ob- 
tained on December 14th, 2016 in a 40-2 array configuration with baseline lengths of 15-459 m, 
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giving a naturally weighted synthesized beam size of ~ 1". There were 40 antennas available, 
with total on source integration time of 22 minutes. Ceres and J2357-5311 were used as flux 
and phase calibrators respectively. The [CII] line (vrest = 1900.5 GHz) was observed at as part 
of the same ALMA project on March 23rd, 2017, tuning in Band 7 to the redshifted line at 
Vons = 358.3 GHz in the upper sideband covering 356 to 360 GHz. These observations used the 
40-2 array configuration with baselines of 16-459 m, giving a naturally weighted synthesized 
beam size of ~ 0.5". An on-source integration time of 14 min was obtained, and J2357-5311 
was used as both the flux and phase calibrator. The data were re-processed using CASA and the 
standard ALMA-supplied calibration using natural beam weighting to maximize sensitivity. 

One dimensional spectra are extracted from the centroid of the line emission for each source 
and binned into 75 km s^! channels. Spectra are presented in Figure 1, and are smoothed using 
a Gaussian filter with FWHM = 100 km s^! for presentation. A Gaussian line profile is fit using 
a least-squares method, providing errors to the velocity offsets from z — 4.300 in Table 1 and 
line widths in Extended Data Table 2. The continuum level is left as a free parameter in the 
fitting function which is then subtracted to derive line fluxes. 


1.2.1 Blind search for [CII] 


We performed a blind search for [CII] line emission in the ALMA band 7 data cube toward 
SPT2349-56. For this, we follow the procedure used to detect line emitters in the ASPECS 
survey. We use a data cube channelized at 100 km/s, without primary beam correction and 
continuum subtraction. 

We used the Astronomical Image Processing System (AIPS) task SERCH. This task con- 
volves the data cube along the frequency axis with a Gaussian kernel defined by different input 
linewidths, subtracts surrounding continuum, and reports all channels and pixels that have a 
signal-to-noise ratio (SNR) over a specified limit. The SNR is defined as the maximum sig- 
nificance level achieved after convolving over the Gaussian kernels. We used a set of different 
Gaussian kernels, from 200 to 600 km/s and searched for all line peaks with SNR>4.0. 

Once all peaks were identified, we used the IDL routine CLUMPFIND*’ to isolate indi- 
vidual candidates. A full list of 68 positive line peaks with SNR>4.0 were thus obtained. We 
quantified the reliability of our line search based on the number of negative peaks in our ALMA 
cube, using the same line procedure. We find 43 negative peaks with SNR «5.8. This means 
that all positive line candidates with SNR 76.0 are likely real (100% purity). Out of the 14 [CII] 
line candidates detected, all have SNR>6.3 and 13 are associated with continuum detections in 
the ALMA data. 


1.3 ATCA CO(2-1) 
1.3.1 Observations 


We used the Australia Telescope Compact Array (ATCA) in its H168 hybrid array configuration 
to observe the CO(2-1) emission line (Vrest = 230.5380 GHz) toward SPT2349-56 (with a pri- 
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mary beam size of 53"). The observations were performed as part of project ID C2818 during 
2016 October 2,3 and 11 under good weather conditions (atmospheric seeing values 90-400 m) 
with five working antennas. 

We used the ATCA 7-mm receivers, with the Compact Array Broadband Backend config- 
ured in the wide bandwidth mode.?? This leads to a total bandwidth of 2 GHz per correlator 
window and a spectral resolution of 1 MHz per channel (6.9 km/s per channel). The spectral 
windows were centred at observing frequencies of 43.5 and 45.0 GHz, and aimed at observing 
the CO line and continuum emission, respectively. 

Gain and pointing calibration were performed every 10 min and 1 h, respectively.The bright 
sources 1921-293, 1934-638 and 2355-534 were used as bandpass, flux and gain calibrators, 
respectively. We expect the flux calibration to be accurate to within 15 per cent, based on 
the comparison of the Uranus and 1934-638 fluxes. The software package MIRIAD?? and the 
Common Astronomy Software Applications (CASA?) were used for editing, calibration and 
imaging. 

The calibrated visibilities were inverted using the CASA task CLEAN using natural weight- 
ing. No cleaning was applied given the relatively low significance of the CO line detection in 
individual channels. The final data cube, averaged along the spectral axis, yields an rms of 
0.23 mJy beam! per 100 km/s channel with a synthesized beam size of 5.6" x4.5" (PA=70.4 
deg) at 43.5 GHz. 


1.3.2 Results 


One source formally detected at the centre, which corresponds to CII/continuum sources B+C+G. 
This central CO source (C) is unresolved at the resolution of the ATCA observations. Other two 
sources are marginally detected to the East (E) and North (N) of the central source, coinciding 
with the location of ClI/continuum sources D+E and A+K, respectively. We extracted spectra 
at these locations and obtained integrated line intensities, by fitting Gaussian profiles to the 
identified line emission. 

We compute CO luminosities using the integrated line intensities and compute gas masses 
by assuming a ULIRG Xco factor of 0.8 (Mo (K km/s pc?) !) and that the CO gas is in local 
thermodynamic equilibrium thus L(coa-1L(01-0) The results of the CO line observations 
are summarized in Extended Data Table 2. Collapsing the line-free spectral window along the 
spectral axis over the 2-GHz bandwidth, leads to a non-detection of the continuum emission 
down to 80 uJy/beam (30). 

These results confirm the finding from CO(4-3) line that that the main reservoir (7296) of 
molecular gas resides in the B+C+G system, with a smaller fraction hosted at the East and North 
locations. 
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1.4 Spitzer imaging 


This field was twice observed at 3.6 and 4.5 um with the Infrared Array Camera (IRAC?) on 
board the Spitzer Space Telescope.? It was first observed in 2009 August as part of a large 
program to obtain follow-up imaging of a large sample of SPT-selected SMGs sources (PID 
60194, PI Vieira). The observing scheme used for PID 60194 was to obtain 36 dithered 100 sec 
integrations at 3.6 um and, separately, a much shallower 12 x 30sec integration at 4.5 um. 
Later, in Cycle 8, the field was covered serendipitously as part of the Spitzer-SPT Deep Field 
survey (PID 80032, PI Stanford; Ashby et al. 2013). PID 80032 surveyed 92 deg? uniformly in 
both IRAC passbands to a depth of 4 x 30 sec. Using established techniques, we combined all 
exposures covering the SPT target from PID 60194 and 80032 at 3.6 and 4.6 um to obtain the 
best possible S/N in our final mosaics, which were pixellated to 0.5". Nine of the 14 sources 
identified by ALMA are detected in the IRAC bands at > 3c in at least one of the 3.6 or 4.5 um 
channels, as shown in Extended Data Figure 4. 


1.5 Analysis of the surrounding field with SPIRE and LABOCA imaging 
In Extended Data Figure 5, our deep SPIRE RGB image is shown with LABOCA contours over- 


laid. A source sample is culled from the 250 jum-selected catalog (135 sources with SNR(250 wm)>3 


in an area of 52 arcmin?), where the source peaks are best defined. To account for the large beam 
size difference with SPIRE (ranging from 36" at 500 um to 18" at 250 um), we employed a de- 
blending code, using the 250 um positions as spatial priors, which provides the standard param- 
eters as well as the covariance matrices highlighting the degeneracies (almost none at 250 jum, 
but significant at 500 um). The code, FASTPHOT,* takes into account these degeneracies to 
estimate the flux measurement errors. 

Colour-colour (CC) and colour-flux (CF) diagrams are shown in Extended Data Figure 5. 
The CC diagram shows a 250 jum-selected sample with SNR(250 um)>3 and is dominated by 
the z ~ 1 cosmic infrared background (blue, green colours) in the foreground of SPT2349- 
56. The CF diagram shows an additional SNR(500 ~m)>3 cut to highlight just the well de- 
tected 500 um source sub-sample. These diagrams highlight the extreme and red properties 
of SPT2349-56, but make clear that one of the three 250 wm-peaks within the SPT2349-56 
LABOCA structure is very likely a foreground galaxy (green symbol highlighted in the figure 
shows very blue colours). Nevertheless, a full ALMA mapping of the structure is warranted 
given the uncertainties involved in the SPIRE deconvolution procedure. 

Five red sources consistent with z ~ 4 (S599,4, > S350um > S250m) are found in the 
surrounding ~ 10'x 10' field and are candidates for additional protocluster members in an ex- 
tended, collapsing structure. If all these sources were bona fide z = 4.3 sources, this would 
significantly increase the total 870-j.m flux density (and thus the far-IR luminosity) of the clus- 
ter beyond the 110 mJy found in the central structure, making it by far the most active system 
known in the Universe (see Figure 2).The deep LABOCA map marginally detects the closest of 
the five red SPIRE sources at ~ 3c, consistent with expectations given the SPIRE flux densities. 
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Full analysis of these surrounding SMGs will require additional follow-up efforts. 


2 Properties, Comparisons, Simulations 


2.1 Derivation of physical properties 


We briefly describe our procedures for calculating various physical quantities from observables 
below. To derive SFR, we measure 870-um flux density directly in the lower sideband (line-free 
bands) of our ALMA Band-7 observations from Cycle 4, finding consistent measurements with 
those found in previous shallower observations.'é We adopt an SFR-to-Sg;0 um ratio of 150 + 
50 Mo yr !/mJy, which is typical for SMGs.* The uncertainty in this ratio owes to variations 
in the dust temperature distribution amongst the SMG population, which are primarily driven 
by differences in the ratio of the luminosity absorbed by dust to the total dust mass."é This 
combined with the measurement error dictates the error on the SFR shown in Table 1 

Gas mass is calculated from the CO(4-3) line luminosity, which is converted to CO(1-0) 
luminosity using a ratio between the brightness temperatures of these lines r4; = 0.41 + 
0.07 found from the average of a sample of unlensed SMGs with multiple CO line transitions 
detected.^" We use a conservative conversion factor aco = TEES and multiply by 1.36 
to account for the addition of helium. When CO(4-3) is not significantly detected, we use our 
[CII] line luminosity and the average CO(4-3)/[CII] ratio for our detected sample; we denote 
these sources with asterisks in Table 1. 

The dynamical masses of galaxies are estimated using the following relation for a dispersion- 
dominated system, with C — 1.56 for a spherical distribution: 


Mayn(Mo) = C x 10902. R (1) 


where oc; is the line-of-sight velocity dispersion in km s^! calculated from the Gaussian fit to 
each line profile and R is the radius fit to the 345-GHz continuum of each source. Size for 
all sources are calculated by a 2D Gaussian fit to each source deconvolved with the naturally 
weighted synthesized beam (0.5” FWHM), although most are only marginally resolved in the 
image beyond the beam. For the smallest sources, where the error on the fit is not significant 
above the beam size, we adopt the 0.5” FWHM beam as an upper limit on the source size. Sizes 
for the sources are displayed in Extended Data Table 1 We use o, from the CO(4-3) profile 
for all sources except H, K, L, M, and N, where [CII] profile is used because CO(4-3) is not 
detected. 


2.2 Spectral energy distribution of SPT2349-56 


The SPT, LABOCA and SPIRE measurements resolve the SPT2349-56 structure to varying 
degrees, but none can isolate the core region resolved by our current ALMA observations with 
any confidence. We thus assemble a photometric catalog of the total SPT2349-56 flux density 
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from 250 um to 3mm and model the resulting total SED to estimate some global properties 
of the system. We do not include the SPT 1.4, 2.0, and 3.0 mm points because the source is 
elongated and flux measurements are difficult with the filtering used to make the map. At IRAC 
wavelengths in the mid-infrared, we detect 9 SMGs significantly and use these to determine a 
lower limit on the stellar mass assembled to date in SPT2349-56. 

We have used Code Investigating GALaxy Emission (CIGALE)^* ? for the SED fitting of 
the combined photometry of the source. The SED modelling assumes a single-component star 
formation history and solar metallicity.? A Chabrier?! IMF is assumed. The resulting best- 
fitting SED is shown in Extended Data Figure 7. The IR luminosity (from 8-1100 um) is (8.0 + 
1.0) x 10? Lo. The stellar mass inferred for this fit is highly uncertain, i.e., 9.5x 1011 E1.3x 10? 
solar masses. The best-fit SED corresponds to a stellar mass of 4.5 x 10!! solar masses, which 
is a lower limit on the stellar mass because not all the sources are detected in the IRAC bands. 
More optical-NIR photometry is needed to better constrain the total stellar mass of the system 


2.3 Protocluster comparison sample 


To place SPT2349-56 in context and compare to other systems claimed to be protoclusters, we 
assemble from the literature various SMG-rich overdensities at 2 < z < 5. Although a direct 
comparison of the number counts (number/deg?) of SMG-overdense systems can be performed, 
it involves making somewhat arbitrary choices of enclosed areas and redshift boundaries. We 
have opted in Figure 2 to instead show a curve of growth analysis of the 870-um flux density. 
Only galaxies confirmed to be protocluster members via spectroscopic redshifts are considered. 
The data are drawn from a recent compilation? and original references therein. 

The GOODS-N overdensity at z = 1.999555 spans a ~ 10' by 10’ field in the Hubble 
Deep Field North containing 9 SMGs in Az = 0.008. The probably of finding this large of 
an overdensity being drawn from the field distribution by chance is < 0.01%. Interestingly, 
only a modest overdensity of Lyman-break galaxies is found in this GOODS-N structure. The 
COSMOS z = 2.5 SMG overdensity? is similar to the GOODS-N structure in terms of the 
numbers and luminosities of the component SMGs, the angular size of the system, and the 
modest overdensity of LBGs associated with it. The MRC1138 overdensity was originally 
discovered as an overdensity of Lya and Ha emitters.?" Follow-up observations?*?? revealed 
the presence of 5 SMGs, in addition to an AGN known as the ‘Spiderweb galaxy’. This is 
a radio-loud AGN that resides in a large Ly-o halo. The SSA22 protocluster was one of the 
first discovered by observing an overdensity of LBGs.™ It is an extremely extended structure 
located at z = 3.09, with LAEs spanning greater than 50 comoving Mpc (cMpc).! Submm 
observations of the field have revealed a population of at least eight SMGs!953. 62.63.65 

The COSMOS z = 2.1 protocluster® lacks sufficiently deep 850-um data to characterize 
the Herschel-SPIRE sources identified in the structure. We estimate 870-um flux densities by 
taking their published Lin (integrated over 3-1100 um) and use the SED of Arp 220 to estimate 
S70 m, finding that Lir = 2 x 10? Locorresponds to $370 jm = 1 mJy at z ~ 2. For the SSA22 
protocluster, we use the measured 870-j.m flux density when available and otherwise estimate 
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it from the 1.1-mm flux using a standard conversion at z ~ 3 of Sg70 ym = 2x S115. To create 
the curves of growth for Figure 2, the centre of each protocluster is defined by computing the 
median RA and DEC of all submm sources. We checked that adjusting the centres of the curve 
of growth tracks randomly by ~ 1' did not boost the curves by more than 10%, demonstrating 
that the curves of growth for the literature SMG overdensities are insensitive to the adopted 
centre. 

Recently, there have also been detections of SMG overdensities at z > 4. The first, GN20, 
at z = 4.05, was discovered through the serendipitous detection of CO(4-3) from two SMGs,9 
with two further SMGs detected subsequently.$$ An excess of B-band dropouts is also observed 
in this structure, several of which are spectroscopically confirmed to lie at z ~ 4.05. HDF850.1 
contains a single SMG, a QSO, and 11 spectroscopically confirmed galaxies. The SMG has 
a confirmed redshift of z = 5.18. The AzTEC-3 overdensity is centred on a single SMG at 
z = 5.3, with 12 spectroscopically confirmed optical galaxies at the same redshift. This is a 
relatively dense structure, with most of the galaxies residing within a circle ~1’ in diameter. 
The most luminous example at z > 4, SMM J004224, was recently found from the Herschel 
surveys,” with several additional 870j:m sources in the field which the authors claim may be 
related to the central source by their over density (characterized as twice the over density of 
the blank field counts). In Fig. 2 we place all of their surrounding sources in the comparison, 
however on average about half of these sources are likely to be in the protocluster. 

Overdensities of SMGs and optical galaxies have also been found around high-redshift radio 
galaxies (HzRGs)," continuing to confirm HzRGs as useful beacons of structure forming in the 
early Universe. However none of these systems come close to the level of overdensity found 
in SPT2349-56, and furthermore, they suffer from the bias inherent in targeting these sources, 
namely, that one or more protocluster members have to be radio-luminous. 

There have also been discoveries of compact binary HyLIRG systems, the most luminous 
of which is the z = 2.4 source HATLAS J084933,"' with others approaching this luminos- 
ity.^- In each of these systems, the dynamics and SFRs are dominated by two SMGs, but 
there is no strong evidence of any surrounding protocluster in the form of an excess of galaxies 
selected optically or in the submm. In one case,” there is evidence for a relative void around 
the structure. These systems may simply be instances of very rare events in fairly typical (but 
still massive) halos,"^ analogous to hyper-luminous quasars.” 

Theoretical studies of N-body simulations have shown that the progenitors of z = 0 dark 
matter halos with masses > 10'° Mọ should extend over length-scales of > 5 cMpc at z > 
2.1228 Since the overdensities listed above are mostly concentrated in small areas, it is difficult 
to asses their exact evolution or compare them easily to simulated structures. Interpreting a 
small overdense region at high-redshift as a ‘protocluster core’ is certainly prone to misinter- 
pretation, and small overdense regions at high redshift can evolve into halos spanning a range 
of masses at the current epoch.'* These authors suggest investigating if an overdensity extends 
to larger scales (> 20 cMpc) to better determine whether it will form an M > 10!°Mocluster. 
However this is difficult at high redshift because the excess of galaxies will be less pronounced 
on larger scales, and it is challenging to detect high-redshift, low-luminosity galaxies. 
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2.4 Simulations 


To further place SPT2349-56 in context, we compare with the predictions of a theoretical 
model for SMG overdensities."^76 We employ the MultiDark”’ N-body simulation, which is 
one of the largest (2.91 Gpc?) available N-body simulations that still resolves SMG-like halos 
(Mhao Z 107? Mo). The z = 4.68 and z = 4.1 snapshots, which are the available snapshots 
closest in redshift to SPT2349-56, are analyzed. Halo catalogs were created using the Rockstar 
halo finder, and stellar masses are assigned to dark matter halos using a relation derived based 
on sub-halo abundance matching relation. To assign SFRs, it is assumed that the distribu- 
tion of specific SFR (SFR per unit stellar mass, hereafter SSFR) is the sum of two Gaussians, 
corresponding to quiescently star-forming and starburst galaxies. The median SSFR value is 
based on the abundance-matching-derived relation,?? and the starburst fraction and the widths of 
the Gaussian distributions are set based on observations of massive, high-redshift star-forming 
galaxies similar to the members of SPT2349-56.” Ma is estimated from stellar mass using em- 
pirical gas fraction and metallicity relations." Once SFR, M,, and Maust values are assigned to 
each halo, 5570 ım is calculated using the following fitting function, which was derived based on 
the results of performing dust radiative transfer on hydrodynamical simulations of both isolated 


and interacting galaxies; 8! 


SFR 0.43 M, 0.54 
m = 081 mJy ( ————À 2 
Sero x (x Mo —} (xs c) 2) 


where 5g79,,, is the 870-um flux density, SFR is the star formation rate, and Ma is the dust 
mass. Scatter of 0.13 dex is included when applying the relation. 

Once 5870 um has been assigned to each halo, we search the entire simulation volume for the 
most luminous regions. We begin at each independent halo and calculate the total S70 pm of 
all halos within a given radius r of this halo. For each value of r, we record the largest total 
S70 um Obtained (across all halos). One hundred Monte Carlo iterations are performed for each 
snapshot; in each iteration, galaxy properties are re-assigned, drawing from the distributions 
described above. The shaded region in Figure 2 shows the entire region spanned by the 100 
realizations of the maximum 579,4, VS. area curves. To compare to SPT2349-56 to lower 
redshift proto-clusters we preform a similar analysis on a snapshot at z — 2.49 with 20 Monte 
Carlo iterations. 
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Figure 1: The SPT2349-56 field and spectra of the constituent galaxies. (a) The LABOCA 
870-um contours of SPT2349-56 overlaid on the IRAC 3.6-um image; the 26” beam at 870 um 
is shown. Contours represent SNR = 3,7 and 9. The small circles show the locations of the 
14 protocluster sources. (b) ALMA band 7 imaging (276 GHz, 1.1 mm) displaying the 14 con- 
firmed protocluster sources, labeled A-N. Black (blue) contours denote the points 38% and 
50% of the peak flux for each source from the CO(4-3) ([CII]). The dashed black line shows 
where the primary beam is 50% the maximum. The filled blue ellipse shows the 0.4” natu- 
rally weighted synthesized beam. (c) CO(4-3) spectra (black lines) and [CH] spectra overlaid 
(shaded yellow bars) for all 14 sources centred at the biweight cluster redshift z = 4.304. The 
[CII] spectra are scaled down in flux by a factor of ten for clarity of presentation. The red arrows 
show the velocity offsets determined by fitting a Gaussian profile to the CO(4-3) spectra for all 
sources except for H, K, L, M, and N, for which we used [CH] because these are not detected 
in CO(4-3). 
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Figure 2: Comparison of SPT2349-56 to other cluster and proto-cluster systems. (a) The 
cumulative 870-um flux density vs. on-sky area for SPT2349-56, compared to other SMG-rich 
overdensities at high redshift (see Methods for details). The solid black line shows the ALMA- 
identified sources in SPT2349-56, while the dashed line includes the wider-field LABOCA- 
detected structure. The blue (green) shaded region denotes the maximum flux density vs. area 
curves obtained in 100 Monte Carlo realizations of a theoretical model for submm-luminous 
protoclusters at z — 4.5 (z — 2.5) based on an N-body simulation (see Methods for full details). 
Most of the literature SMG overdensities are consistent with the model expectations, whereas 
SPT2349-56 lies vastly above the region spanned by the model. A recently discovered z = 4 
protocluster from Herschel,” SMM J004224, is quite a unique system but > 10 times less dense 
than and likely only ~ 50% the total luminosity of SPT2349-56. (b) The cluster mass versus 
redshift is shown for SPT2349-56 and other massive galaxy clusters from the literature with 
detected ICM and well-defined masses. The colour scheme highlights the different methods for 
selecting massive clusters employed (brown = X-ray, blue = optical, green = Sunyaev-Zeldovich 
effect). Error bars represent the 1 o standard deviation. We also show the mean protocluster 
most-massive-progenitor mass vs. z relation predicted by N-body simulations. The location 
of SPT2349-56 in this plane suggests a very massive descendant (halo mass of 10 Mo at z = 
0), although we caution that the complex growth histories of dark matter halos make it difficult 
to reliably predict the z = 0 halo mass from the halo mass at a given epoch.”! 
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Table 1: Derived physical properties of SPT2349-56 protocluster members. 


Source Avi SFR Moas Máy, 
[km s^!] [Mo yr] [10 Mo] [10" Mg] 
A 137235 11704293 12.042.1 11.5 2.7 
B 107-31 12274307 11.242.0 84-20 
C 830 + 12 907 1227 6741.2 «14 
D 196 + 40 530 +140 484-1. 17.5 4.8 
E 312 +21 497 +141 48+0.9 <24 
F 623 + 82 505 +128 34+07 12.4063 
G —74 +37 409+ 103 1.6 + 0.4 < 12 
H —492 +28 310480 44-20! 44+1.1* 
I 537 + 78 268 +71 2.2 +40.5 « 53 
J —251 +35 2431067 2.2300. 2321029 
K 862 + 12 208 +54 314147 15+40.2* 
L —147 + 18 122+34 33+15' 24+0.5* 
M 261 +21 75+30 120.6) <0.4* 
N 319 + 25 64 + 25 1.0 + 0.57 < 0.9 * 


° Unresolved sources represent upper limits on the dynamical mass 
t Velocity offsets relative to z = 4.300 

* [CII] line used to derive Mgasas CO(4-3) is not detected 

* [CII] profile used to derive Mayn, otherwise CO(4-3) used 
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Extended Data Table 1: Observed properties of SPT2349-56 protocluster members 


Source | RA (J2000) Dec (J2000) $1090 pm S370 um 53.6 um S45 um CO(4-3) f Sdv CO(4-3) FWHM [CII] f Sdv [CII] FWHM Size ! 
[h:m:s] [d:m:s] [mJy] [mJy] [uJy] [uJy] [Jy km s~!] [km s-!] [Jy km s^] [km s-!] [kpc] 

A 23:49:42.67 —56:38:19.3. 4.63 +0.04 78401 43405 5513 0.99 + 0.03 376 + 46 8.81 + 0.26 354 + 30 52x «2.8 
B 23:49:42.79 —56:38:24.0. 4.35 +0.04 8:20. 6.4 0.3 - 0.92 + 0.03 341 + 38 7.53 + 0.22 314 + 28 3.2 x 3.1 
C 23:49:42.84 —56:38:25.1 2.69+0.04 6.00.1 19.6+40.8 18.0 + 2.8 0.55 + 0.02 154 + 13 4.43 + 0.17 160 + 10 <2.8 x <2.8 
D 23:49:41.42 —56:38:22.6 2.20+0.08 35403 34405 4842.2 0.69 + 0.04 485 + 64 3.62 +0.78 | 346 + 129 41x «2.8 
E 23:49:41.23. —56:38244 2.12+0.11 33404 68407 7.12.1 0.39 + 0.02 199 + 23 3.47 +1.24 3104137 <28x <28 
F 23:49:42.14 —56:38:25.8 1.69 +0.05 34+0. 2.0 + 0.4 - 0.28 + 0.03 396 + 103 4.28 + 0.35 353 + 35 4.5 x 2.9 
G 23:49:42.74 —56:38:225.1 1.11 +0.04 2.740. 1.6 + 0.4 - 0.14 + 0.02 147 +41 - - <2.8 x <2.8 
H 23:49:43.46 —56:38:26.2 0.85 +0.05 2.140. 132402 - - - 3.63 + 0.30 236 + 31 3.9 x 3.7 
I 23:49:42.22 —56:38:28.3 0.78 +0.05 1.8+0. 12x02 - 0.18 + 0.03 277 X 90 3.18 + 0.32 236 + 24 3.1 x « 2.8 
J 23:49:43.22 —56:38:30.1 0.61 +0.06 1640.2 49405 69420 0.19 + 0.02 151 + 38 3.79 + 0.29 138 + 15 6.8 x 4.1 
K 23:49:42.96 —56:38:17.9 0.34 +0.04 L.4 € 0. 36406 5241.6 - - 2.54 + 0.17 129 + 12 52x43 
L 23:49:42.38 —56:38:25.8 023 +0.04 0.8 € 0. 39405 541.8 - - 2.78 + 0.20 176 + 20 4.1 x 2.9 
M 23:49:43.39 —56:38:21.1 0.21 £0.05 0540.2 38405 S1+1.6 - - 1.04 + 0.14 87 + 23 - 
N 23:49:43.27 —56:38:229 0.18 +0.04 0.4 € 0. 34406 461.7 - - 0.86 + 0.16 128 + 26 - 


+ Major and minor axis FWHM source sizes after de-convolution with a 0.5-arcsec Gaussian beam. Sizes are converted from arcsec to kpc 
assuming an angular diameter distance of 6.9 kpc per arcsec at z = 4.3. The typical uncertainty in the quoted sizes is 1.5 kpc. Sources with a 


de-convolved size less than 0.4 arcsec (2.8 kpc) are considered unresolved. 


Extended Data Table 2: Properties of the 3 ATCA CO(2-1) sources 


ATCA source | ALMA ID J Sdv Ov L'(CO 2-1) Meas 
[Jy km s] [kms !] 10! [K km s™t pe?) [10! Mo] 


Central (C) B,C, G 0.69 + 0.076 372 + 47 1.22 + 0.12 1.33 + 0.15 
West (W) D,E 0.16 + 0.04 166 + 47 0.29 0.07 0.32 + 0.08 
North (N) A, K 0.085 + 0.0028 175 + 68 0.15 + 0.05 0.16 + 0.05 
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Extended Data Table 3: Observed properties of all red (Ss00ym > S350pm > S2504m) 
SPIRE sources in the field surrounding SPT2349-56. The LABOCA sources corresponding 
to SPT2349-56 are listed first, and the red SPIRE sources in the surrounding field follow. AII 
sources listed are highlighted in Extended Data Figure 5. 


RA (32000) Dec (J2000) S550 um S350 um S500 um S850 um d! 
[h:m:s] [d:m:s] [mJy] [mJy] [mJy] [mJy] [arcmin] 

23:49:42 —56:38:25 4543 7153 9643 770+42.9 - 
23:49:43 —56:37:31 2143 3743 4343 250+2.8 0.9 
23:49:25 —56:35:27 2344 2644 3244 2941.7 5.2 
23:49:39 —56:36:33 1243 1643 2344 39:13 2.1 
23:49:36 —56:41:17 7+3 1443 1943 3.21.6 3.2 
23:49:55 —56:34:17 6+4 1043 2045 48+1.8 5.3 
23:49:12 —56:40:31 1145 1645 22+5 68+2.6 7.7 


* Distance from central SPT2349-56 source. 
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Extended Data Figure 1: Herschel-SPIRE image. A RGB scale is used to represent 500, 350, 
and 250 um, with the red SPT2349-56 extended complex clearly visible in a relative void in the 
foreground z ~ 1 cosmic infrared background (blue to green-coloured galaxies). 
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Extended Data Figure 2: Wide field 870- jm image and photometry. A wide-field LABOCA 
image (21” beam size) of SPT2349-56. The image rms noise is 1.3 mJy at center, increasing 
to 2mJy at the edges of the region shown. The total flux density recovered is 110.0+9.5 mJy. 
Sub-apertures are drawn showing three different regions and their recovered flux densities. SPT 
1.4-mm contours are also shown (blue), revealing that even with the 1’ beam of SPT, SPT2349- 
56 is resolved. One additional submm source is detected at > 5o in the LABOCA image to the 
east of the primary source, though Herschel-SPIRE photometry indicates that it is unlikely to 
be at z ~ 4. 
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Extended Data Figure 3: CO(2-1) observations of SPT2349-56. (a): The colormap and red 
contours trace the CO(2-1) line integrated over the central 830 km s !, with contours spaced 
by 3c starting at 3c. The grey contours show the 1.1-mm ALMA continuum detections. (b): 
One-dimensional spectra extracted at the positions indicated in (a). 
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Extended Data Figure 4: IRAC observations of SPT2349-56. Circles display the location 
of the 14 sources detected in ALMA band 7 described above. Nine sources are detected with 
IRAC, including the two faintest [CIT] sources from the blind line survey. 
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Extended Data Figure 5: SPIRE RGB image and source colours in field surrounding 
SPT2349-56. (a) Deep SPIRE false colour image is shown with LABOCA contours over- 
laid. Locations of 250-um peaks used for analysis are marked with crosses (the faintest are not 
visible because of the contrast adopted in the image). Colour-colour (CC, (b)) and colour-flux 
(CF, (c)) diagrams for 250 jum sources are also shown. Error bars represent the 1 o standard de- 
viation. The CC diagram shows sources with SNR(250 jum)»3 and is dominated by the z ~ 1 
cosmic infrared background in the foreground of SPT2349-56 (sources with colours ranging 
from blue to green). The CF diagram applies an additional SNR(500 um)>3 cut. The CC and 
CF diagrams show that one of three peaks associated with SPT2349-56 is likely a lower-redshift 
interloper (green symbol), but also that there are five additional sources (blue symbols) in the 
surrounding 2 Mpc region with colours (S509 pm > 350m > 5250ym) that are suggestive of 
224.3. 
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Extended Data Figure 6: Line-free 870-jjm continuum image. Contours represent 1090 um 
and begin at 0.15 mJy (SNR- 5) and increase in steps of 2 mJy. The half-power beam widths 
are also shown for the 870-um observations (long dashed line — 18") and 1.1-mm observations 
(short dashed line — 23"). We find good agreement between the two wavelengths, with all 
sources A-L detected in both images. Note that neither the 870-um image nor the 1090-jm 
contours are corrected for the primary beam, thus sources away from the center, especially D & 
E, appear fainter than they are intrinsically. 
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Best model for SPT2349-56 at z = 4.304. Reduced x?=0.48 
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Extended Data Figure 7: Spectral energy distribution of SPT2349-56. The SED of the 
extended SPT2349-56 source is shown, including the summed deconvolved Herschel-SPIRE 
flux densities, the total 870-um LABOCA flux density, and the summed IRAC flux densities. 
We do not include the SPT 1.4, 2.0, and 3.0 mm points because the source is elongated and flux 
measurements are difficult with the filtering used to make the map. Fitting the SED yields an 
IR luminosity of (8.0 + 1.0) x 10? Lo. 
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